Modelling Errors in Automatic Speech Recognition for Dysarthric Speakers
نویسندگان
چکیده
Dysarthria is a motor speech disorder characterized by weakness, paralysis, or poor coordination of the muscles responsible for speech. Although automatic speech recognition (ASR) systems have been developed for disordered speech, factors such as low intelligibility and limited phonemic repertoire decrease speech recognition accuracy, making conventional speaker adaptation algorithms perform poorly on dysarthric speakers. In this work, rather than adapting the acoustic models, we model the errors made by the speaker and attempt to correct them. For this task, two techniques have been developed: (1) a set of “metamodels” that incorporate a model of the speaker’s phonetic confusion matrix into the ASR process; (2) a cascade of weighted finite-state transducers at the confusion matrix, word, and language levels. Both techniques attempt to correct the errors made at the phonetic level and make use of a language model to find the best estimate of the correct word sequence. Our experiments show that both techniques outperform standard adaptation techniques.
منابع مشابه
Maximum Likelihood Linear Regression (MLLR) for ASR Severity Based Adaptation to Help Dysarthric Speakers
Automatic speech recognition (ASR) for dysarthric speakers is one of the most challenging research areas. The lack of corpus for dysarthric speakers makes it even more difficult. The speaker adaptation (SA) is an alternative solution to overcome the lack of dysarthric speech and enhance the performance of ASR. This paper introduces the Severity-based adaptation, using small amount of speech dat...
متن کاملVocal tract representation in the recognition of cerebral palsied speech.
PURPOSE In this study, the authors explored articulatory information as a means of improving the recognition of dysarthric speech by machine. METHOD Data were derived chiefly from the TORGO database of dysarthric articulation (Rudzicz, Namasivayam, & Wolff, 2011) in which motions of various points in the vocal tract are measured during speech. In the 1st experiment, the authors provided a bas...
متن کاملAutomatic recognition of dutch dysarthric speech: a pilot study
This paper describes a feasibility study into automatic recognition of Dutch dysarthric speech. Recognition experiments with speaker independent and speaker dependent models are compared, for tasks with different perplexities. The results show that speaker dependent speech recognition for dysarthric speakers is very well possible, even for higher perplexity tasks.
متن کاملRunning Head: DIFFICULTIES IN AUTOMATIC SPEECH RECOGNITION DIFFICULTIES IN AUTOMATIC SPEECH RECOGNITION OF DYSARTHRIC SPEAKERS AND THE IMPLICATIONS FOR SPEECH-BASED APPLICATIONS USED BY THE ELDERLY: A LITERATURE REVIEW
Automatic speech recognition is being used in a variety of assistive contexts, including home computer systems, mobile telephones, and various public and private telephony services. Despite their growing presence, commercial speech recognition technologies are still not easily employed by individuals who have speech or communication disorders. While speech disorders in older adults are common, ...
متن کاملEstimation of Phoneme-Specific HMM Topologies for the Automatic Recognition of Dysarthric Speech
Dysarthria is a frequently occurring motor speech disorder which can be caused by neurological trauma, cerebral palsy, or degenerative neurological diseases. Because dysarthria affects phonation, articulation, and prosody, spoken communication of dysarthric speakers gets seriously restricted, affecting their quality of life and confidence. Assistive technology has led to the development of spee...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- EURASIP J. Adv. Sig. Proc.
دوره 2009 شماره
صفحات -
تاریخ انتشار 2009